Comparison of Data Models for Unsupervised Twitter Sentiment Analysis

نویسندگان

چکیده

"Identifying the sentiment of collected tweets has become a challenging and interesting task. In addition, mining defining relevant features that can improve quality classification system is crucial. The data modeling phase fundamental for whole process since it reveal hidden information from textual inputs. Two models are defined in presented paper, considering Twitter-specific concepts: hashtagbased representation text-based one. These will be compared integrated into an unsupervised determines groups based on labels (positive negative). Moreover, wordembedding techniques (TF-IDF frequency vectors) used to convert representations numeric input needed clustering methods. experimental results show good values Silhouette Davies-Bouldin measures environment. A detailed investigation several items (dataset, method, representation, or word embeddings) checking best setup increasing detecting Twitter’s messages. analysis conclusions first considered more complex experiments. Keywords: Sentiment Analysis, Twitter, Data Representation, Hashtags, Clustering. "

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SSA-UO: Unsupervised Sentiment Analysis in Twitter

This paper describes the specifications and results of SSA-UO, unsupervised system, presented in SemEval 2013 for Sentiment Analysis in Twitter (Task 2) (Wilson et al., 2013). The proposal system includes three phases: data preprocessing, contextual word polarity detection and message classification. The preprocessing phase comprises treatment of emoticon, slang terms, lemmatization and POS-tag...

متن کامل

Sentiment Analysis of Twitter Data

We examine sentiment analysis on Twitter data. The contributions of this paper are: (1) We introduce POS-specific prior polarity features. (2) We explore the use of a tree kernel to obviate the need for tedious feature engineering. The new features (in conjunction with previously proposed features) and the tree kernel perform approximately at the same level, both outperforming the state-of-the-...

متن کامل

Sentiment analysis of twitter data

Social networks are the main resources to gather information about people’s opinion and sentiments towards different topics as they spend hours daily on social medias and share their opinion. In this technical paper, we show the application of sentimental analysis and how to connect to Twitter and run sentimental analysis queries. We run experiments on different queries from politics to humanit...

متن کامل

2016 Olympic Games on Twitter: Sentiment Analysis of Sports Fans Tweets using Big Data Framework

Big data analytics is one of the most important subjects in computer science. Today, due to the increasing expansion of Web technology, a large amount of data is available to researchers. Extracting information from these data is one of the requirements for many organizations and business centers. In recent years, the massive amount of Twitter's social networking data has become a platform for ...

متن کامل

GTI: An Unsupervised Approach for Sentiment Analysis in Twitter

This paper presents the approach of the GTI Research Group to SemEval-2015 task 10 on Sentiment Analysis in Twitter, or more specifically, subtasks A (Contextual Polarity Disambiguation) and B (Message Polarity Classification). We followed an unsupervised dependency parsing-based approach using a sentiment lexicon, created by means of an automatic polarity expansion algorithm and Natural Langua...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Studia Universitatis Babes-Bolyai: Series Informatica

سال: 2023

ISSN: ['2065-9601', '1224-869X']

DOI: https://doi.org/10.24193/subbi.2022.2.05